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Abstract — In this paper, we study polar codes from a practi- 
cal point of view. In particular, we study concatenated polar 
codes and rate-compatible polar codes. First, we propose a 
concatenation scheme including polar codes and Low-Density 
Parity-Check (LDPC) codes. We will show that our proposed 
scheme outperforms conventional concatenation schemes formed 
by LDPC and Reed-Solomon (RS) codes. We then study two 
rate-compatible coding schemes using polar codes. We will see 
that polar codes can be designed as universally capacity achieving 
rate-compatible codes over a set of physically degraded channels. 
We also study the effect of puncturing on polar codes to design 
rate-compatible codes. 

Index Terms — Polar Codes, Rate-Compatible Codes, Concate- 
nated Codes. 



I. Introduction 

Polar codes, recently proposed by Arikan, are the first 
class of provably capacity achieving codes for symmetric 
binary-input discrete memoryless channels (BDMC) with low 
encoding and decoding complexity [1]. Since Arikan's paper, 
there have been many papers studying the performance char- 
acteristics of polar codes (see for example [2]-[8]). However, 
the research on polar codes, to a great extent, has been limited 
to the theoretical issues. In this paper, we will look at two 
important practical problems that can be addressed using polar 
codes: concatenated codes, and rate-compatible codes. As it 
is shown, polar codes prove to be very powerful in both 
applications. In addition to the capacity-achieving capability, 
polar codes have several other interesting properties from the 
practical point of view (such as having good error floor perfor- 
mance [9]). This suggests that a combination of polar coding 
with another coding scheme could eliminate shortcomings of 
both, and provide a powerful coding paradigm. In this paper, 
we consider the design of polar code-based concatenated 
coding schemes that can contribute to closing the gap to 
the capacity. Since its introduction by Dave Forney in 1966, 
concatenated coding has been studied extensively for different 
combinations of coding schemes and different methods of 
concatenations have been proposed. Furthermore, there have 
been many applications, such as deep space communications, 
magnetic recording channels, and optical transport systems 
that use a concatenated coding scheme [10]-[13]. A coding 
scheme employed in these applications needs to show strong 
error correction capability. In optical fiber communications, a 
minimum bit error rate (BER) of at least 10^^^ is generally 
required [12], [13]. Here, we investigate the potentials of 
using polar codes in a concatenated scheme to achieve such 
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low error rates without error floor. The problem of designing 
concatenated schemes using polar codes is yet to be studied. 
The only previous work on this topic is [7] where the authors 
study the classical idea of code concatenation using short 
polar codes as inner codes and a high-rate Reed-Solomon (RS) 
code as the outer code. It is shown that such a concatenation 
scheme with a careful choice of parameters boosts the rate 
of decay of the probability of error to almost exponential 
in the block-length with essentially no loss in computational 
complexity. While [7] mainly considers the asymptotic case, 
we are interested in improving the performance in practical 
finite lengths. 

In this paper, we study practical concatenation schemes that 
achieve a performance close to capacity, while keeping a low 
complexity and avoiding error floor. We study the combination 
of polar codes with LDPC codes, assuming polar codes as the 
outer code and LDPC codes as the inner code. Comparing our 
proposed scheme against conventional concatenated schemes 
which employ a concatenation of RS, BCH, or LDPC codes 
we will show that polar-LDPC combination can actually 
outperform existing schemes. Our results suggest that polar 
codes have a great potential to be used in combination with 
other codes in real-world communication systems. 

Rate-compatibility over time-varying channels is another 
important practical issue where error-correction codes are re- 
quired to be flexible with respect to their code rates depending 
on the current channel state. In this paper, we study polar codes 
for rate-compatible applications. We first see that polar codes 
are inherently well-suited for rate-compatible applications. We 
present a simple rate-compatible scheme that can universally 
achieve the channel capacity for a set of channels, using the 
same encoder and decoder. We will then study puncturing to 
design rate-compatible polar codes. Puncturing is widely used 
in the literature to generate rate-compatible codes [14]-[16]. 
We will investigate the performance of random puncturing and 
stopping-tree puncturing (explained later) as used for polar 
codes and compare them to the universally capacity-achieving 
scheme. The rest of this paper is organized as follows. We first 
explain the notations and provide a short background on polar 
codes. Then, we study Concatenated polar codes in Section 
imi Rate-compatible polar codes are studied in Section |IV] 
Finally, Section |V] concludes the paper. 

II. Preliminaries 

In this section, we explain some notations and preliminary 
concepts we will be using in our analysis. The construction of 
polar codes is based on the following observation: Let F = 
[ } " ] . Apply the transform i^®" (where (S^n denotes the nth 
Kronecker power) to a block of A^ = 2" bits and transmit the 
output through independent copies of a symmetric BDMC, call 
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Fig. 1. Normal realization of the encoding graph for N = 8. An example 
of the stopping set is shown with black variable and check nodes. 



it W. As n grows large, the channels seen by individual bits 
(suitably defined in [1]) start polarizing: they approach either 
a noiseless channel or a pure-noise channel, where the fraction 
of channels becoming noiseless is close to the capacity I{W). 
The channel polarization phenomenon suggests to use the 
noiseless channels for transmitting information while fixing 
the symbols transmitted through the noisy ones to a value 
known both to the sender and receiver. Since the fraction 
of channels becoming noiseless tends to I{W), this scheme 
achieves the capacity of the channel. In the following, let 
u = (ui,...,ujv), X = (a;i,...,a;Ar), and y ^ (j/i, ..., ^at) 
denote, respectively, the vectors of input bits, code-bits, and 
channel output bits. We denote the channels seen by input bits 
by T/^(*), z = 1, 2, ..., N and call them bit-channels. 

The generator matrix of polar codes, denoted by Gn, is 
a sub-matrix of the Kronecker product F®" in which only 
a subset of rows of F®" are present. An equivalent way of 
expressing this is to say that the codewords are of the form 
X = uF'^'^, where the components Ui of u corresponding to 
the "frozen bits" are fixed to and the remaining components 
contain the "information". Information bits, according to [1], 
are chosen to be the ones corresponding to the best (least 
noisy) polarized channels. We denote the set of information 
bits by An{W). 

A Successive Cancelation (SC) decoding scheme is em- 
ployed in [1] to prove the capacity-achieving property of the 
code. However, [2] and [3] later employed belief propagation 
decoding to obtain better BER performance while keeping the 
decoding complexity at 0(A^log A^). Belief propagation can 
be run on the tanner graph of the code, easily obtained by 
adding check nodes to the encoding graph of polar codes; an 
example of such a graph is shown in Fig. [T]for a code of length 
8. We use a belief propagation decoder for the simulations 
throughout this paper. 

In a rate-compatible coding scheme, our goal is to provide 
reliable transmission over a set of channels with parameters 
0^,j = 1,...,J, using the same encoder and decoder Let 
Wj and c{d^) denote a channel with parameter 9^ and its 
capacity, respectively. Assume that 9^ < 9^ and c{9^) > c{9^) 
for i > j. We call a rate-compatible scheme universally 
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Fig. 2. Block diagram of the proposed concatenated system of polar and 
LDPC codes. 



capacity achieving (UCA), if the sequences of codes generated 
according to that scheme achieve the channel capacity c{9^) 
for j = 1, .., J. Assume that if 9^ < 9^ then Wj is a degraded 
version of Wi, which we denote by Wj :< Wi. We may also 
assume that the channel state information is available at the 
transmitter so that the appropriate code rate can be chosen for 
communication. 

III. Concatenated Polar Codes 

In this section, we study concatenated polar codes. Our 
proposed scheme is formed of a Polar code as the outer code, 
and a LDPC code as the inner code. Fig. |2] shows the block 
diagram of this scheme. We consider long powerful LDPC 
codes as the inner code with rates close to the channel capacity. 
LDPC codes can be decoded in linear time using BP, and at the 
same time can get very close to the capacity. However, LDPC 
codes with good waterfall characteristics are known to mostly 
suffer from the error floor problem. Here, polar codes come to 
play their role. In [9], we studied the error floor performance 
of polar codes in finite length. As it is shown in [9], polar 
codes show very good error floor performance. Furthermore, 
the following theorem is proved in [9] about the girth of the 
tanner graph of polar codes. 

Theorem 1: Any polar code of length 8 or more has a girth 
of at least 12. 

Theorem [T] shows a natural advantage of polar codes over 
LDPC codes. In fact, a girth of 12 or more is considered so 
desirable for LDPC codes that many techniques have been 
proposed in the literature to guarantee such girths (see for 
example [17] and references therein). Such a large girth also 
contributes to the good error floor performance. Based on the 
facts mentioned above, the combination of polar and LDPC 
codes is expected to yield to a powerful concatenated code 
with BER performance close to the capacity for a broad range 
of the channel parameter Here we consider binary polar codes 
as the outer code concatenated with a binary LDPC code as the 
inner code. This is different from the traditional concatenated 
schemes [18] in which a non-binary code is usually used as 
the outer code. 

In order to investigate the performance of our proposed 
scheme, we consider a real-world application which uses a 
concatenated coding scheme as its essential part. An Optical 
Transport Network (OTN) is a set of optical network elements 
connected by optical fibre links, able to transport client signals 
at data rates as high as 100 Gbit/s and beyond. These net- 
works are standardized under ITU-T Recommendation G.709 



and are an important part of the high data-rate transmission 
systems such as Gigabit Ethernet and the intercontinental 
communication network. OTU4 is the standard designed to 
transport a 100 Gigabit Ethernet signal. The EEC in the 
standard OTU4 employs a block interleaving of 16 words of 
the (255,239,17) Reed-Solomon codes, resulting in an overall 
overhead of 7%. This scheme guarantees an error floor-free 
performance at least down to BERs of 10~^^, and provides a 
coding gain of 5.8 dB at a BER of 10"^'^. Since the approval 
of this standard (February 2001), several concatenated coding 
schemes have been proposed in the literature and some as 
patents, targeting to improve the performance of this standard. 
In most cases, these schemes propose a concatenation of two 
of Reed-Solomon, LDPC, and BCH codes [11]-[13], [19]. 
Here, for the first time, we consider polar-LDPC concatenation 
for the OTU4 setting. 

A. Encoder 

In order to satisfy the overhead of 7%, we adopt an effective 
code rate of 0.93. That is if we denote the code-rates for 
the polar and LDPC codes by Rp and Ri respectively, then 
Reff = Rp ^ Ri needs to be 0.93. The first problem is to 
find the optimal code-rate combination for the two codes to 
achieve the best BER performance. While this is an interesting 
analytical problem, it might be a difficult problem to solve. 
Therefore, we will find the best rate combination empirically. 
First, note that both Rp and Ri are greater than 0.93. We 
are also aware of the relatively poor error rate performance of 
finite-length polar codes compared to LDPC codes. Therefore, 
in order to minimize the rate loss, we choose Ri close to 
the Reff- As a result, Rp would be close to 1. Now, we 
can use simulations to examine different code rates for Ri, 
each slightly above Reff, to find the best choice. Fig. [3] 
shows the BER performance of three different rate couples, 
as a sample of all the rate couples we simulated. Code- 
length for the polar code is fixed to 2^^ for all the rate 
couples. Showing a rate couple by {Rp,Ri), these three rate 
couples are (0.989,0.94), (0.979,0.95), (0.969,0.96). We picked 
(0.979,0.95) for the rest of our simulations in this paper as it 
shows a better performance in the low-error-rate region and 
this is important for our application here. As it can be seen 
in the figure, choosing the rate couple can be a compromise 
for any specific application. Fixing the code-length 2^^ for 
the polar code and fixing the rates to (0.979,0.95), the LDPC 
code-length would be 68985. We used the following optimal 
degree distribution pair which has a theoretical threshold of 
0.435: 

X{x) = 0.502197a; + 0.375436a;2 + 0.061422a;3 + 0.060945x^ 

and 

pix) = 0.076592cc^'' + 0.923408x^^ 

B. Decoder 

At the decoder side, we perform belief propagation decoding 
with soft-decision for both the polar and LDPC codes. Upon 
finishing its decoding, the LDPC decoder will pass its output 
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Fig. 3. BER performance comparison for different rate combinations in a 
polar-LDPC concatenated sclieme. 



vector of LLRs to the polar decoder. Polar decoder then treats 
this vector as the input for its belief propagation process. 



C Simulation Results 

Fig. ID depicts the BER performance for the concatenated 
scheme explained above, when using the LDPC code above. 
For the channel, we assumed a binary symmetric gaussian 
channel as it is used by [11]-[13], [19]. Along with the 
concatenated scheme, we have shown the performance of the 
LDPC code and the polar code when used alone with an 
effective rate of 0.93, which is equal to the effective rate 
of the concatenated scheme. As it can be seen, while the 
concatenated scheme shows a significant improvement over 
the polar code, it follows the performance of LDPC code 
in the waterfall region closely. Since both polar and LDPC 
codes here are capacity-achieving (or capacity-approaching), 
the concatenation does not suffer from rate loss in theory. 
Therefore, by increasing the code-length we expect that the 
curve for polar-LDPC scheme close the gap to the capacity. 
The curve also shows no sign of error floor down to BERs of 
10~^^, as opposed to the curve for the LDPC-only scheme. In 
a polar-LDPC concatenation, the two codes are orchestrated to 
cover for each other's shortcomings as follows: LDPC plays 
the dominant role in its waterfall region, while polar code is 
dominant in the error floor region of the LDPC code. 

To see the significant potential of polar codes for con- 
catenated schemes, we also showed the BER performance 
corresponding to the G.709 standard explained earlier in the 
paper, as well as the scheme proposed in a recent patent 
[19]. Both systems use a code rate of 0.93. The scheme in 
[19] employs a RS-BCH concatenated code with an overall 
minimum distance of 252. The code-length for this scheme is 
65280 which is very close to our proposed scheme. As it can 
be seen, at a BER of 10^^^, polar-LDPC has an edge of 1.45 
dB in SNR over the RS-BCH. An improvement of 2.9 dB over 
the G.709 standard can also be observed at this BER. 
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Fig. 4. BER performance for different concatenated scliemes. 



IV. Rate-Compatible Polar Codes 

In this section, we study two rate-compatible approaches 
to polar codes: universally capacity achieving rate-compatible 
polar codes, and punctured polar codes. 

A. Universally Capacity Achieving Rate-Compatible Polar 
Codes 

Universally capacity achieving rate-compatible (UCARC) 
polar codes can achieve the channel capacity for a set of 
channels, using a low complexity encoder UCARC polar 
codes can be explored using the following Lemma from [20] 
(Lemma 4.7). 

Lemma 1: Let W and W be two symmetric BDMCs such 
that ly is a degraded version of W. Then, M^(*) is degraded 
with respect to W^'^^ for i = 1, ...,N. 

This lemma implies that in a physically degraded setting, an 
order of polarization is maintained in the sense that "good" 
bits for the degraded channel, must also be "good" for the 
better channel. As a result, the set of information bits for the 
degraded channel is a subset of the set of information bits for 
the better channel, i.e. An{W) C AnCW). 

Corollary 1: Let Wj, j = 1, ..., J be a set of symmetric 
BDMC channels such that Wi ^ W2 ^ ... ^ Wj. Suppose 
that An{Wj) is known for j = 1, ..., J. Then, for any i and 
j such that Wj :< Wi, the capacity achieving polar code for 
Wj can be obtained from the polar code designed for Wi, by 
setting the input bits in AN{Wi) \ An{Wj) to zero in the 
encoder 

This means that to implement different rates, the encoder 
only needs to shrink its set of information bits by switching 
a few of them to zero. This leads to a simple and practical 
structure for the UCARC polar codes. This can be considered 
as an important advantage of polar codes over other coding 
schemes such as LDPC and turbo codes for which finding a 
UCARC scheme can be very complicated if even possible. Fig. 
13 a) shows the structure of encoder for a UCARC polar code 
of length N=8. As it is shown, the input bits can be switched 
by the encoder to operate either as an information bit or a 
frozen bit. 
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Fig. 5. Different schemes for rate-compatible polar codes, (a) UCARC polar 
codes, (b) Punctured rate-compatible polar codes. 



B. Puncturing for Rate-Compatible Polar Codes 

In this section, we consider puncturing for rate-compatible 
polar codes. Figure [S^b) shows the encoder structure for the 
punctured rate-compatible polar codes. In this scheme, a parent 
code is designed for the worst channel (with largest channel 
parameter). In order to generate codes with higher rates for 
better channels, the encoder punctures some of the output bits. 
For every channel parameter 0^ , j = 1 , . . . , J, a puncturing 
pattern is determined off-line and loaded into the encoder. 
The punctured bits will not be sent over the channel. In the 
decoder side, the log-likelihood ratios for these bits will be 
set to zero before running belief propagation. The optimal 
puncturing pattern for each rate and a specific tanner graph can 
be found using optimization techniques [14], [15]; however, it 
turns out it is difficult to use such techniques for polar codes. 

1) Random Puncturing: A simple way of puncturing, which 
is studied in some papers, is to have the encoder choose the 
punctured bits for each rate randomly. Random puncturing is 
actually proved to be a UCARC scheme for LDPC codes 
over the BEC [16]. Fig. |6] shows the gap-to-capacity (for 
the gaussian channel) for randomly punctured polar codes 
compared against the UCARC polar codes described in section 
IIV-AI As it can be seen in the figure, there is a substantial 
distance between the two curves. 

2) Stopping-Tree Puncturing for Polar Codes: Here, we 
propose an algorithm involving the stopping sets in the tanner 
graph to improve the performance of puncturing. A stopping 
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Fig. 6. Gap-to-capacity for different rate-compatible schemes over the 
Gaussian channel. Parent-code rate for punctured codes is 1/2, parent code- 
length is 2^^, and BER is fixed to 10"". 

set in the tanner graph is defined as a set of variable nodes such 
that every neighboring check node of the set is connected to at 
least two variable nodes in the set. Fig. [T] shows an example 
of the stopping set in the polar codes' graph. Stopping sets 
play an important role in the bit error rate and error floor 
performance of the code [21]. A stopping tree in the polar 
codes' tanner graph is a stopping set shaped as a tree rooted 
at an information bit (on the left side of graph) and with leaves 
at the code-bits (on the right side of graph), such as the one 
shown in Fig. [T] with black variable and check nodes. We 
refer to such a tree as the stopping tree of an information bit. 
Stopping trees in fact form an important group of the stopping 
sets. We have shown in [9] that a minimum stopping tree is 
a minimum stopping set as well. For more on the structure 
of stopping sets and the importance of the stopping trees in 
belief propagation decoding of polar codes we refer the reader 
to [9]. 

For any code-bit in the tanner graph, we can find the number 
of stopping trees that have that specific code-bit as a leaf node. 
Then, we pick the punctured code-bits from the ones which are 
present in the fewest number of stopping trees. This algorithm 
is based on the empirical results which show that the chance of 
recovery for these code-bits is higher than others in the case 
erasure. In other words, these code-bits are better protected 
than others in the tanner graph. Since the information bits are 
known and the graph has a simple structure, we can easily find 
these bits. We call this algorithm Stopping-Tree Puncturing. 
As an example of this algorithm, suppose that we want to 
puncture the parent code of rate 1/2 in Fig. |3b) to a code 
of rate 3/4. Then we need to pick 2 code-bits to puncture. If 
we look at Fig. [T] we find out that xg is the only bit that is 
present in only one stopping tree. Among the code-bits which 
are present in two stopping trees we can choose .Ty. 

Fig. |6] shows the simulation results for stopping-tree punc- 
turing compared to other techniques. As it can be seen, the 
gap-to-capacity has improved over the random puncturing 
though the distance to the UCARC scheme is still noticeable. 

V. Conclusion 

We considered concatenated coding using polar codes. We 
proposed a polar-LDPC scheme and showed, via simulations. 



that it can result in considerable improvement over the existing 
concatenated schemes. We also studied different approaches to 
rate-compatible polar codes over a set of physically degraded 
channels. We showed that UCARC polar codes can be de- 
signed with low complexity using the inherent characteristics 
of polar codes. We also studied the use of puncturing to 
generate rate-compatible polar codes. 
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